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Promoter Reporter Construct 






A = pGRN 1 44 
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181 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 

NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-1.2 
TGGGCTTCCCC 

241 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

301 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGRMTYYCC 

Topo_I I_c 1 eavage_s i t e 
RNYNNCNNGYNGKTNYNY 

361 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 
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I AAAACCCCAA AACCCCAAAA CCCCTTTT.AG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTG.AAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA ttataaagat 

25 1 ttagaagata ttaaaatatt tgcgcagaca aatattgttg ctactccacg 
30 1 agactataat gaagaagatt ttaaagttat tgcaagaaaa gaagtatttt 

3 5 1 caactggact AATG ATCG AA cttattg aca aatgcttagt TGAACTTCTT 

40 1 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 

4 5 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
551 a-ttcgaaatg AGCTCTTCCG ACATCTCTAC ACTAAATATT taatattcca 
60 1 gcgaacttct gaaggaactc ttgttcaatt TTGCGGGAAT aacgtttttg 
65 1 atcatttgaa agtcaacgat aagtttgaca aaaagcaaaa aggtggagca 
70 i gcagacatga ATGAACCTCG ATGTTGATCA acctgcaaat ACAATGTCAA 
751 GAATGAGAAA. GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
80 1 ATATGAAATC A.AGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 I GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT tgcctacatg 

1 00 1 cttgagaaag tcaaagattt ta acttcaac tactatttaa caaaatcttg 

1 05 1 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA gctgtttagc 

i 15 1 tacacaactg ataataaatg CGTCACACAA tttattaatg aatttttcta 

I 20 1 caatatactc CCCAAAGACT ttttg actgg aagaaaccgt aagaattttc 

1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
I 30 1 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

1 40 1 ACGTCTTATG ga aattg ctc cgatggatat tcgaggatct cgtcgtctcg 
14 51 CTGATTAGaT G ATTTTTCTA TGTC ACCG AG caacagaaaa gttactccaa 
i 50 I aacctattac tacagaaaga atatttggga cgtcattatg aaaatgtcaa 

155 1 TCGCAGACTT AAAGAAGGAA ACCCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GaaGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GaCTTTCAAT AAGAAGATTG 
I 70 1 TAAATTCAGA CCGGAAGACT ACAAAATTAA ctacaaatac caagttattg 
1751 AACTCTCACT TAATGCTTAA GaCATTG AAG AATAGAATGT TTAAAGaTCC 

1801 ttttggattc gctgttttta actatgatga tgtaatgaaa aagtatcagg 

1 85 1 agtttgtttg CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 

1 90 1 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 I AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 

2 I 5 1 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
220 I AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
225 I CCAATATAAT TACATTAACT TTAATGGGA A G I I 1 1 ATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GT-rrCATCAA ttttgtcatc attttattat 
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2j5 1 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
240 1 CCCTGAA AAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
245 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGrTTAT TGAGAAACTT 
IIV, ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
III ^i^I^^"^ CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
260 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
.nl ^J^ttgata TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
275 1 r^l^^.?^^^ ^^^J^'^t^'^^ T.^'^^'^^j^A^ CATGCAAACA AAGAAAGCAT 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3I5I 
320 



CAAT GTGGC T CAAGAAGAAA CTAAAGTCGT ttttaatgaa taacattacc 
cattatttta gaaagacgat tacaaccgaa gactttgcga ataaaactct 
caacaagtta tttatatcag gcggttacaa atacatgcaa tgagccaaag 
aatacaagga ccactttaag aagaacttag ctatgagcag tatgatcgac 
ttagaggtat ctaaaattat atactctgta accagagcattctttaaata 
nSIS^^*" AATATTAAGG atacaatttt tggagaggag cattatccag 

ACTTnrCCTTAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
aagtacattt TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
GCTAAAAAGT GA CCAA TGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
CTATTCTAAC TTA TTTTG GA AAGTTAATTT TCAATTTTTG TCTTATATAC 



3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 (VfEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWTQKVIRC RNQSQSHYJCD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NOLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLhO^INVP NVWNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNTSAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
35 1 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKJNT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWTFEDLWS 
45 1 LIRCFFYVTE QQKSYSKTYY YRKNTWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDEEKCYDS VNREKLSTFL KTnCLLSSDF WIMTAQILKR KNNTVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KJCTLIVEAJCO 
/Ol RNYFKKDNLL QPVrNICQYN YINFNGfCFYK QTKGIPQGLC VSSILSSFYY 
/5i ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEI-O, 
801 nvfVSRENGFK F^^MKKLQTSF PLSPSfCFAKY GMDSVEEQNI VQDYCDWIGI 
85 1 SIDMK TLALM PNfNLRJEGI LCTLNLNMQT KKASMWLKiCK LKSFLMNNTT 
901 HYFRKTm-E DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KhfLAMSS.VtID 
95! LEVSKJIYSV TRAFFKYLVC NIICDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



\ 
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L ggcaccgatccaccttccccccctcacaagccaaccgccccctcgaacgcccccaaacccccggaaacactcccacaaga 80 
81 actcaataacaataccaagccaaaccccaacatgaaggcgttattagtgaccgacaacacttccactttaccggccgtca 160 
161 ccaagcacaaggacaaaaagaacaacctccctccccctiaaagacttttaccttattaacctacLcntcaaatacactccg 240 
2 41 ggttcgcctaccttcaaccgtggcactgtnttagccgctactitctagccaaccgcgtgtttictaccccgtcactiggatat 320 
321 agcncttggagcagctcacagaaacccccacaaaccctccgatgagactatattiagacccantacagcccgcgcacactc 400 
401 tcaacacggagccctacactttagacgagccacgccgcatgatggagtatttggtaccacccaacgcccgcctcgaaaag 480 
4 81 gctgacaaccacctgcaaaaccacgtcctcagcggcggtaatccgcgaaagctttttgacgctcgcacacgtccagcatg 560 
561 actgagacattcaaaaacctccacccactacaactcctttaacgcggtitttanttticctattttctactctcacgctigtt 640 
641 ccaaacatgcatcanctcgcatcaggcccccctccgctttactcctggaaccgtaccttcttcactaccccccccaatga 720 
721 acaacccaaattagntccgcttacaattgatagnagtagaaagattggcgaccccacccgcgcaacgccactagcctaaa 800 
801 gatactttgcaaaacacctatcagccatcaccacataaaaaaaatcccataattacaaatactaaccaacacctgcggcc 880 
8 81 actatctatctaaaacgccacgaccagcaggacaccrtigcatatatatagtcatgcttaatggcnacttgcaacctgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
IMTEHHTPKSRILRFLENQYV 20 

1019 TAG CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG CCA AGC TCG 107 8 
21YLCTL -NDYVQLVLRGSPASS 40 

1079 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
41YSNICERLRSDVQTSFSrFL 60 



113 9 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
61HSTV VGFDSKPDEGVQFSSP 80 
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1199 AAA TGC TCA CAG TCA GAG gcatacacatccccgccctgacccccccccacccgggacagccaacacacqqacaa 12T> 
81 K C S 0 s E ' 

8 6 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CA 1332 
^^^^^NVVKQMFDES FERRRnl" 106 

1333 CTG ATG AAA GGG TTT TCC ATG gcaaggcactcnaaccgcgaaacacccacccgcaaccaccgccccaaagaga 1405 
107 LMKGFSM j.^"-' 

113 

1406 CCgCaCttaaccgacaaag AAT CAT GAA GAT TTT CGA GCG ATG CAT GTA AAC GGA GTA CAA AAT 1469 



N H E D F R A M H V N G V 



Q N 128 



1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129DLVSTFPNYLISILESKNWQ 148 



^^^^ f"^ ^"^ g"aa«^accggccaagat:gccgcgcacctrgaacaagaccgacaagcacag T ATC GGC 1601 

^ ^ ^ ^ I r G 155 

1602 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 
^ 156 SDAMHYLLSKGSIFEALPND 175 

aiB62 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 
.n 176 N Y L Q I s G I P L F K N N V F E E T V 195 

j=yli722 TCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 
,jl96S KKRKRTIETSITQNKSARK 215 

r.1782 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 
■216EVSWNSrsrSRFSrFYRSSY 235 

AAG AAG TTT AAG CAA G gcaaccaacaccgccacccrccacaaccaatcccag AT CTA TAT TTT AAC 

g 236 K K F K Q D . L Y F N 245 

#9;08 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 
%j 246 LHSICDRNTVHMWLQWIFPR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 
^^^Q^G'-INAFQVKQLHKVIPLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 
286 SQSTVVPKRLLKVYPLrEOT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 
^''^'^'^'^LHRISLSKVYNH YCPYI 325 

214 8 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 
326 DT HDDEKrLSYSLKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 
^**^^^«SrLVRvFPKLrWGNQRI 365 

2268 TTT GAG ATA ATA TTA AAA G gca ccgca caaaaccca t caccaccaacgacc ccaccag AC CTC GAA ACT 2336 

366 F £ r r L :< D l e r 375 
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2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 FLKLSRYESFSLHYLMSNIK 395 

23 97 gcaacacgccaaacccttttiaccaccaaccaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 

396 ISEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KRSNAKMCLSDFEKRKQIFA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFI IPILQSFFY 445 

25 8 6 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 264 5 

446 ITESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

J66LLCRPFrTSM:<M£AFEKINE 485 

2706 gcacrctaaagcattcctcgcaaaaagccaacactttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

486 NNVRMDTQKT 495 

277 6 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2 83 5 

496 TLPPAVIRLLPKKNTFRLIT 515, 

283 6 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtactaac CCCtggcca tcaacgcact tcscccccaacccatta 2906 

516 NLRKRFLIK 524 

2907 ctagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

525 MGSNKKMLVSTNQTLRPV 542 

2 9'6 8 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 
543 ASILKH-LI NEESSGIPFN LE 562 

3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaac 3088 

563 VYMKLLT FKKDLLKHRMFG 581 

3089 cacacaacgcgcgaccccccaccaccaacttcgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDr 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3 215 

592 KSCYOR IKQDLMFRrVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG* TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3 275 

612 KDPEFVIRKYATIHATSORA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gcaagcccacccc cccactggaaccncccaacaa 3343 

532 TKNFVSEAFSYF. 643 

3344 acccCCCttcag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVP FEKVVQLLSMKT 659 

3 40 6 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3 4 65 

560 S OT LF V O . F VD Y WT KSS SE I ^ ^"^^ 

3 4 66 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gcacaccaaccgccgaaccgcaacaaca 3 5 32 
680 KMLKEHLSGH IVK 
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3 53 3 ccaacgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3 593 

693 IGNSQYLQKVGIPQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 

709 I LSSrLCHFYMED LIDEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 FTKKKGSVLLRVVDDFLFIT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gcgagccgccgccac tcc 3777 

749 VNKKDAKKFLNLSLRG 764 

3778 caagccccaaccgcrgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3 840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 INFENSNGIINNTFFNESKK 798 

3 901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3 960 

799 RMPFFGFSVNMRS LDTLLAC 818 

3961 CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4 020 

819 PKIDEALFNSTSV E LTKHMG 838 

4021 AAA TCT TTT TTT TAC AAA ATT CTA AG gcacaccgcgtaactgaacaacagccgacaaacaaccag A TCG 4089 

839 KSFFYKILR S 84'8 

4090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

349 SLASFAQVFIDITHNSKFNS 868 

i 41Si} TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4 209 

869 CCNIYRLGYSMCMRAQAYLK 888 

; 4210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagcaccca cctiTiaaccaga 4274 

■ 889 RMKDIFI PQRMFITO 903 

4275 aaagccaccaaccaaccccag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4 339 

904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA jGGA TAT ACG AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gcacgcgcc 4401 

918 EILGYTSRRFLSSAEVKW 935 

4402 ggccccgagacLCcagcaacaccgacacaccag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 

936 LFCLGMRDGLK 946 

44 69 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4 528 

947 PSFKYHPCFEQLI YQFQSLT 966 

4 529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4 588 

967 DLIKPLRPVLRQ. VLFLHRRI ^86 

4589 GCT GAT TAA cgtcacccccaacccaccacac:acaccccct:atic.accggcgccnnaaacaacaccaccaccaagcaca 4665 

987 A D • 989 
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4 666 gccgacccccaaagcaagcacaccacaggacccccagcaaagcaaaaccaaccncgtcaccagccttigaccgacccgtict 4745 

474 6 ctacccntatactcttaagaaagaccgacagtiggticgccgaccactgcccacacgcccaccaaacgggagcggncaaaca 4825 

482 6 ccaaaagcaacacatgaggccaacctccttccatccagaacaaggaaagcggccccccacaacgaacaacgcccgcacca 4905 

4 906 acgcaaaaagacgaagactatctcccaaacaagggggactaagcatatccgaaggaaaagagagcaacacacccagcgcc 498 5 

498 6 gccgaagaaagcaaggacaacccggaacaagccticcgcagatigacaggctiaaatcctggcgaccgaactictggcaaaagc 50 6 5 

506 6 cccaggctacccatggtggccggccctgccaccgagacgaaaagaaaccaaggacagcctgaacaccaacagcncactca 514 5 

514 6 acgccttacacaaggccttgctccctcccgacctcaactittgcatgggcgaaaagaaacagcgtcaagccacnatinggat 5225 

522 6 cccgaaatagccaaacttcctggctccticaaagcggaagcccaaagaacttaccgaagcctatgaggcctcaaaaacncc 5305 

53 06 tcccgatttaaaggaggaatectccaccgacgaggaaacggatagcccaccagcngccgaggagaagcccaacctctcgc 538 5 

53 86 aaaaaagaaaacaccactgggagacatctcctgacgaaccagacgcggagagtacccccagcggatccticgatigccaatia 54 65 

54 6 6 acntccacttctgaaangcacggccccaccgtcgcctcgacctctcgtagctccacgcagctaagcgaccaaaggcacc 554 4 
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1 gcagcgctgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 
61 gcgcgctccc cgctgccgag ccgtgcgctc cctgctgcgc agccactacc gcgaggtgct 
121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 
181 ggacccggcg gctttccgcg cgctggtggc ccagtgcctg gtgtgcgtgc cctgggacgc 
241 acggccgccc cccgccgccc cctccttccg ccaggtgtcc tgcctgaagg agctggtggc 
301 ccgagtgctg cagaggctgt gcgagcgcgg cgcgaagaac gtgctggcct tcggcttcgc 
361 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 
421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 
481 ccgcgtgggc gacgacgtgc tggttcacct gctggcacgc tgcgcgctct ttgtgctggt 
541 ggctcccagc tgcgcctacc aggtgtgcgg gccgccgctg taccagctcg gcgctgccac 
601 tcaggcccgg cccccgccac acgctagtgg accccgaagg cgtctgggat gcgaacgggc 
661 ctggaaccat agcgtcaggg aggccggggt ccccctgggc ctgccagccc cgggtgcgag 
721 gaggcgcggg ggcagtgcca gccgaagtct gccgttgccc aagaggccca ggcgtggcgc 
781 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 
841 gcgtggaccg agtgaccgtg gtttctgtgt ggtgtcacct gccagacccg ccgaagaagc 
901 cacctctttg gagggtgcgc tctctggcac gcgccactcc cacccatccg tgggccgcca 
961 gcaccacgcg ggccccccat ccacatcgcg gccaccacgt ccctgggaca cgccttgtcc 
1021 cccggtgtac gccgagacca agcacttcct ctactcctca ggcgacaagg agcagctgcg 
1081 gccctccttc ctactcagct ctctgaggcc cagcctgact ggcgctcgga ggctcgtgga 
1141 gaccatcttt ctgggttcca ggccctggat gccagggact ccccgcaggt tgccccgcct 
1201 gccccagcgc tactggcaaa tgcggcccct gtttctggag ctgcttggga accacgcgca 
1261 gtgcccctac ggggtgctcc tcaagacgca ctgcccgctg cgagctgcgg tcaccccagc 
1321 agccggtgtc tgtgcccggg agaagcccca gggctctgtg gcggcccccg aggaggagga 
13 81 cacagacccc cgtcgcctgg tgcagctgct ccgccagcac agcagcccct ggcaggtgta 
1441 cggcttcgtg cgggcctgcc tgcgccggct ggtgccccca ggcctctggg gctccaggca 
1501 caacgaacgc cgcttcctca ggaacaccaa gaagttcatc tccctgggga agcatgccaa 
1561 gctctcgctg caggagctga cgtggaagat gagcgtgcgg gactgcgctt ggctgcgcag 
1621 gagcccaggg gttggctgtg ttccggccgc agagcaccgt ctgcgtgagg agatcctggc 
1681 caagttcctg cactggctga tgagtgtgta cgtcgtcgag ctgctcaggt ctttctttta 
1741 tgtcacggag accacgtttc aaaagaacag gctctttttc taccggaaga gtgtctggag 
1801 caagttgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggagctgtc 
1861 ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ctgctgacgt ccagactccg 
1921 cttcatcccc aagcctgacg ggctgcggcc gattgtgaac atggactacg tcgtgggagc 
1981 cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggtga aggcactgtt 
2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc ctgggcgcct ctgtgctggg 
2101 cctggacgat atccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 
2161 gccgcctgag ctgtactttg tcaaggtgga tgtgacgggc gcgtacgaca ccatccccca 
2221 ggacaggctc acggaggtca tcgccagcat catcaaaccc cagaacacgt actgcgtgcg 
2281 tcggtatgcc gtggtccaga aggccgccca tgggcacgtc cgcaaggcct tcaagagcca 
2341 cgtctctacc ttgacagacc tccagccgta catgcgacag ttcgtggctc acctgcagga 
2401 gaccagcccg ctgagggatg ccgtcgtcat cgagcagagc tcctccctga atgaggccag 
2461 cagtggcctc ttcgacgtct tcctacgctt catgtgccac cacgccgtgc gcatcagggg 
2521 caagtcctac gtccagtgcc aggggatccc gcagggctcc atcctctcca cgctgctctg 
2581 cagcctgtgc tacggcgaca tggagaacaa gctgtttgcg gggattcggc gggacgggct 
2641 gctcctgcgt ttggtggatg atttcttgtt ggtgacacct cacctcaccc acgcgaaaac 
2701 cttcctcagg accctggtcc gaggtgtccc tgagtatggc tgcgtggtga acttgcggaa 
2761 gacagtggtg aacttccctg tagaagacga ggccctgggt ggcacggctt ttgttcagat 
2821 gccggcccac ggcctattcc cctggtgcgg cctgctgctg gatacccgga ccctggaggt 
2881 gcagagcgac tactccagct atgcccggac ctccatcaga gccagtctca ccttcaaccg 
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2941 cggcttcaag gctgggagga acatgcgtcg caaactcttt ggggtcttgc ggctgaagtg 
3001 tcacagcctg tttctggatt tgcaggtgaa cagcctccag acggtgtgca ccaacatcta 
3061 caagatcctc ctgctgcagg cgtacaggtt tcacgcatgt gtgctgcagc tcccatttca 
3121 tcagcaagtt tggaagaacc ccacattttt cctgcgcgtc atctctgaca cggcctccct 
3181 ctgctactcc atcctgaaag ccaagaacgc agggatgtcg ctgggggcca agggcgccgc 
3241 cggccctctg ccctccgagg ccgtgcagtg gctgtgccac caagcattcc tgctcaagct 
3301 gactcgacac cgtgtcacct acgtgccact cctggggtca ctcaggacag cccagacgca 
3 361 gctgagtcgg aagctcccgg ggacgacgct gactgccctg gaggccgcag ccaacccggc 
3421 actgccctca gacttcaaga ccatcctgga ctgatggcca cccgcccaca gccaggccga 
3481 gagcagacac cagcagccct gtcacgccgg gctctacgtc ccagggaggg aggggcggcc 
3 541 cacacccagg cccgcaccgc tgggagtctg aggcctgagt gagtgtttgg ccgaggcctg 
3 601 catgtccggc tgaaggctga gtgtccggct gaggcctgag cgagtgtcca gccaagggct 
3661 gagtgtccag cacacctgcc gtcttcactt ccccacaggc tggcgctcgg ctccacccca 
3 721 gggccagctt ttcctcacca ggagcccggc ttccactccc cacataggaa tagtccatcc 
3 781 ccagattcgc cattgttcac ccctcgccct gccctccttt gccttccacc cccaccatcc 
3 841 aggtggagac cctgagaagg accctgggag ctctgggaat ttggagtgac caaaggtgtg 
3 901 ccctgtacac aggcgaggac cctgcacctg gatgggggtc cctgtgggtc aaattggggg 
3 961 gaggtgctgt gggagtaaaa tactgaatat atgagttttt cagttttgaa aaaaa 



FIGURE 17 
HUMAN TRT PROTEIN SEQUENCE 

(SEQ. NO. 2) 

MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 

AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGF 

LLDGARGGPPEAFTTSWSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLIiARCALFV 

LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPA 

PGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPA 

RPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYS 

SGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPL 

FLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQ 

LLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEL 

TWKMS WDCAWLRRS PGVGCVPAAEHRLREE I LAKFLHWLMS VYVVELL^ 

TFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFI 

PKPDGLRPIVNMDYWGARTFRREKRAERLTSRVKALFSVIJSnfER^ 

LDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYC 

VRRYAWQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETS PLRDAWIEQS S SL 

NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGI PQGS ILSTLIiCSLCYGDMENKLFAG 

IRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLWGVPEYGCVVNLRKTVVNFPVEDEAIj 

ggtafvqmpahglfpwcgllldtrtiievqsdyssyartsirasltfnrgfkagrnmrr 
klfgvlrlkchslfldlqwslqwctniykilllqayrfhacvlqlpfhqqvwknpt 
fflrvisdtaslcysilkaknagmslgakgaagplpseavqwlchqafllkltrhrvt 
yvpllgslrtaqtqlsrklpgttltaleaaanpalpsdfktild 



FIGURE 18 

Clone 712562 
(SEQ ID NO. 3 ) 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 

TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 

GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGC^GCTGCGGGAGCT 

GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 

CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 

AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 

GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 

GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 

CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 

CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 

GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 

CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 

GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 

TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 

CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 

AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 

TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 

TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 

GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 

GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 

ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 

ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 

TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 

CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 

TGACTCGACACCGTGTCACCTAGGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 

AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 

CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 

AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 

CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 

GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 

TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 

AGGGGCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 

CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 

CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 

GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 

GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 

AAAAAAAAAAAAAAAA 
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SEQ ID NO. 10 
MetSerValTyrValValGluLeuLeuArgSerPhePhe 

TyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerValTrp 
SerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGluLeu 
SerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArgLeu 
ArgPhe 1 1 e ProLy s ProAspGlyLeuAr gPr o 1 1 e Va 1 AsnMe t AspTy rVa 1 Va iGly 
Al aArgThr Phe ArgArgGluLy s ArgAl aGluAr gLeuThr S e r ArgVa iLy s Al aLeu 
PheSerValLeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerValLeu 
GlyLeuAspAspIleHisArgAlaTrpArgThrPheValLeuArgValArgAlaGlnAsp 
ProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrllePro 
GlnAspArgLeuThrGluVallleAlaSerllelleLysProGlnAsnThrTyrCysVal 
ArgArgTyrAlaValValGlnLysAlaAlaHisGlyHisValArgLysAlaPheLysSer 
HisValLeiiArgProValProGlyAspProAlaGlyLeuHisProLeuHisAlaAlaLeu 

GlnProValLeuArgArgHisGlyGluGlnAlaValCysGlyAspSerAlaGlyArgAla 
AlaProAlaPheGlyGly 
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SEQUENCE NO. 4 (DNA) AND SEQUENCE NO. 5 (PROTEIN) 
(TRANSLATION OF A A 1 82 hTRT VARIANT) 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 . 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 



110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 




tyr leu pro asn thr val thr 
TAG CTG CCC AAC ACG GTG ACC 

140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 

leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val -cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 
CTG GGA TGC GAA CGG GCC TGG 

val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 
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130 

asp ala leu arg gly ser gly ala 
GAC GCA CTG CGG GGG AGC GGG GCG 

150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 




val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 

290 

gly ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 

gin his his ala gly pro pro 
GAG CAC CAC GCG GGC CCC CCA 

320 

trp asp thr pro cys pro pro 
TGG GAC ACG CCT TGT CCC CCG 



leu tyr ser ser gly asp lys 
CTC TAG TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr ile phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 

leu phe leu glu leu leu gly asn 
CTG TTT CTG GAG CTG CTT GGG AAC 

410 

val leu leu lys thr his cys pro 
GTG CTC CTC AAG ACG CAC TGC CCG 
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280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 

300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAG GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG GAG CTG CGG CCC TCC TTC CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GCT CGG AGG CTC GTG 

370 

arg pro trp met pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr trp gin met arg pro 
CAG CGC TAG TGG CAA ATG CGG CCC 

400 

his ala gin cys pro tyr gly 
CAC GCG CAG TGC CCC TAG GGG 

420 

leu arg ala ala val thr pro 
CTG CGA GCT GCG GTG ACC CCA 



430 
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ala ala gly val cys ala arg 
GCA GCC GGT GTC TGT GCC CGG 

440 

ala pro glu glu glu asp thr 
GCC CCC GAG GAG GAG GAC ACA 

leu arg gin his ser ser pro 
CTC CGC CAG CAC AGC AGC CCC 

470 

ala cys leu arg arg leu val 
GCC TGC CTG CGC CGG CTG GTG 

his asn glu arg arg phe' leu 
CAC AAC GAA CGC CGC TTC CTC 

500 

leu gly lys his ala lys leu 
CTG GGG AAG CAT GCC AAG CTC 



met ser val arg asp cys ala 
ATG AGC GTG CGG GAC TGC GCT 

530 

gly cys val pro ala ala glu 
GGC TGT GTT CCG GCC GCA GAG 



ala lys phe leu his trp leu 
GCC AAG TTC CTG CAC TGG CTG 

560 

leu arg ser phe phe tyr val 
CTC AGG TCT TTC TTT TAT GTC 



arg leu phe phe tyr arg lys 
AGG CTC TTT TTC TAC CGG AAG 



glu lys pro gin gly ser val ala 
GAG AAG CCC CAG GGC TCT GTG GCG 

450 

asp pro arg arg leu val gin leu 
GAC CCC CGT CGC CTG GTG CAG CTG 

460 

trp gin val tyr gly phe val arg 
TGG CAG GTG TAC GGC TTC GTG CGG 

480 

pro pro gly leu trp gly ser arg 
CCC CCA GGC CTC TGG GGC TCC AGG 

490 

arg asn thr lys lys phe ile ser 
AGG AAC ACC AAG AAG TTC ATC TCC 

510 

ser leu gin glu leu thr trp lys 
TCG CTG CAG GAG CTG ACG TGG AAG 

520 

trp leu arg arg ser pro gly val 
TGG CTG CGC AGG AGC CCA GGG GTT 

540 

his arg leu arg glu glu ile leu 
CAC CGT CTG CGT GAG GAG ATC CTG 

550 

met ser val tyr val val glu leu 
ATG AGT GTG TAC GTC GTC GAG CTG 

570 

thr glu thr thr phe gin lys asn 
ACG GAG ACC ACG TTT CAA AAG AAC 

580 

ser val trp ser lys leu gin ser 
AGT GTC TGG AGC AAG TTG CAA AGC 




590 

ile gly ile arg gin his leu 
ATT GGA ATC AGA CAG CAC TTG 

ser glu ala glu val arg gin 
TCG GAA GCA GAG GTC AGG CAG 

62 0 

leu thr ser arg leu arg phe 
CTG ACG TCC AGA CTC CGC TTC 



pro ile val asn-met asp tyr 

CCG ATT GTG AAC ATG GAC TAG 

650 

arg glu lys arg ala glu arg 

AGA GAA AAG AGG GCC GAG CGT 



phe ser val leu asn tyr glu 
TTC AGC GTG CTC AAC TAC GAG 

680 

gly ala ser val leu gly leu 
GGC GCC TCT GTG CTG GGC CTG 

thr phe val leu arg val arg 
ACC TTC GTG CTG CGT GTG CGG 

710 

tyr phe val lys val asp val 
TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 
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600 

lys arg val gin leu arg glu leu 
AAG AGG GTG CAG CTG CGG GAG CTG 

610 

his arg glu ala arg pro ala leu 
CAT CGG GAA GCC AGG CCC GCC CTG 

630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 

660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

690 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC. CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 
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740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAG TGC GTG CGT CGG TAT GCC GTG GTC GAG AAG GCC GCC 



his gly his val arg lys ala phe 
CAT GGG GAG GTG GGG AAG GGG TTG 



760 

lys ser his val leu arg pro 
AAG AGC GAG GTG GTA GGT GGA 



770 780 
val pro gly asp pro ala gly leu his pro. leu his ala ala leu 
GTG GGA GGG GAT GGG GGA GGG GTG GAT GGT GTG GAG GGT GGT GTG 



790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
GAG GGT GTG GTA GGG GGA GAT GGA GAA GAA GGT GTT TGG GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TGG GGG GGA GGG GGT GGT GGT GGG TTT GGT GGA TGA TTTGTTGTTGGT 
GAGAGGTGAGGTGAGGGACGGGAAAAGGTTGGTGAGGAGGGTGGTGGGAGGTGTGGGTGA 
GTATGGGTGGGTGGTGAAGTTGGGGAAGAGAGTGGTGAAGTTGGGTGTAGAAGACGAGGG 
GGTGGGTGGGAGGGGTTTTGTTGAGATGGGGGGGGAGGGGGTATTGGGGTGGTGGGGGGT 
GCTGGTGGATAGGGGGAGGGTGGAGGTGGAGAGGGAGTAGTGGAGGTATGGGGGGAGGTG 
GATGAGAGGCAGTGTGAGGTTGAACGGGGGGTTGAAGGGTGGGAGGAACATGGGTGGGAA 
ACTGTTTGGGGTGTTGGGGGTGAAGTGTGAGAGGGTGTTTGTGGATTTGGAGGTGAAGAG 



GGTGGAGAGGGTGTGGAGGAAGATGTAGAAGATGGTGGTGGTGGAGGGGTACAGGTTTGA 



GGGATGTGTGGTGGAGGTGGGATTTGATCAGGAAGTTTGGAAGAAGGGGAGATTTTTGCT 
GGGGGTGATGTGTGAGAGGGGGTGGCTGTGGTAGTGGATGGTGAAAGGGAAGAAGGGAGG 



GATGTCGGTGGGGGGGAAGGGGGGGGGCGGGGGTGTGGCGTGGGAGGGGGTGGAGTGGGT 



GTGCGAGGAAGGATTGGTGGTGAAGGTGAGTGGAGAGGGTGTGAGGTACGTGGGAGTGGT 
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GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 

TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 

ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCG 

CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 

CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 

GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 

CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 

CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 

CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 

TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 

GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 

AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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RESTRICTION SITES and REGIONS Map of DNA pGRN144.4 

(SEQ. ID. NO. 6) 



3601 ATCGATTGGGCCCGAGATCTCGCGCGCGAGGCCTGCCATGGGACCCACTGCAGGGGCAGC 
TAGCTAACCCGGGCTCTAGAGCGCGCGCTCCGGACGGTACCCTGGGTGACGTCCCCGTCG 

3615 3636 
BGL2 NCOl 

3661 TGGGANGCTGCAGGCTTCAGGTCCCAGTGGGGTTGCCATCTGCCAGTAGAAACCTGATGT 
ACCCTNCGACGTCCGAAGTCCAGGGTCACCCCAACGGTAGACGGTCATCTTTGGACTACA 

3 721 AGAATCAGGGCGCGAGTGTGGACACTGTCCTGAATCTCAATGTCTCAGTGTGTGCTGAAA 
TCTTAGTCCCGCGCTCACACCTGTGACAGGACTTAGAGTTACAGAGTCACACACGACTTT 

3 781 CATGTAGAAATTAAAGTCCATCCCTCCTACTCTACTGGGATTGAGCCCCTTCCCTATCCC 
GTACATCTTTAATTTCAGGTAGGGAGGATGAGATGACCCTAACTCGGGGAAGGGATAGGG 

3 841 CCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTGGAGGAAGGAATGATACTTTGTTATT 
GGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACACCTCCTTCCTTACTATGAAACAATAA 



3901 TTTCACTGCTGGTACTGAATCCACTGTTTCATTTGTTGGTTTGTTTGTTTTGTTTTGAGA 
AAAGTGACGACCATGACTTAGGTGACAAAGTAAACAACC7VAACAAACAAAACAAAAC 



3961 AGCGGTTTCACTCTTGTTGCTCAGGCTGGANGGAGTGCAATGGCGCGATCTTGGCTTACT 
TCGCCAAAGTGAGAACAACGAGTCCGACCTNCCTCACGTTACCGCGCTAGAACCGAATGA 

ALU 

4 021 GCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCTTCCGCCTCCCATTTGGCTGGGA 
CGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGACGAAGGCGGAGGGTAAACCGACCCT 

*************************** ************ 

4081 ttacaggc:acccgccaccatgcccagctaattttttgtatttttagtananacnggggtg 
aatgtccgtgggcggtggtacgggtcgattaaaaaacataaaaatcatntntgnccccac 

A 



4141 GGGGTGGGGTTCACATGTTGGCCAAGCTGGTCTCGAACTTCTGAACTCAGATGATCCANC 
CCCCACCCCAAGTGTACAACCGGTTCGACCAGAGCTTGAAGACTTGAGTCTACTAGGTNG 

LU 



4201 TGCCTCTGCCTCCTAAAATTGCTGGGATTACAGGTGTNANCCACCATGCCCAACTCAAAA 



# 
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ACGGAGACGGAGGATTTTAACGACCCTAATGTCCACANTNGGTGGTACGGGTTGAGTTTT 



4261 TTTACTCTGTTTANAAACATCTGGGTCTAAGGTAGGAANCTCACCCCACTCAATTTTTGT 
AAATGAGACAAATNTTTGTAGACCCAGATTCCATCCTTNGAGTGGGGTGAGTTAAAAACA 

4321 GGTGTTTTTAAGCCAATNANAAAATTTTTTNATGTTGTTTNNimNNNNNl^^ 
CCACAAAAATTCGGTTANTNTTTTAAAAAANTACAACAAANNNNNl^^ 

4381 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

4441 NNNNNNNNNblbn^NNNNNNNin^N^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

4501 NNimNNNlTONNlSn^NNNNNNiraNNOT^^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 

4561 NNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNiraNNNN^ 

4621 NNNNNNlTONNNNNNNNin^NNin^N^ 
NNNNNNNbnsnSTNNNNNNNNNNNl^^ 

4681 NNNNNNNNNNNNtn^NNNNNN^^ 
NNNNNNNNNNNNNNNNNNl^^ 

4741 NNtmWNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNITONNNNNNNNNNN^^ 

4801 NNNNNNNNNNNNNNNNNNNNOT^^ 

NNNNNNNNNimNNNNNNNNNNNNNNNNN^ 

4861 NNNNNNNNNNNNNNNNNNNNNN^^ 

4921 NNNNNIn^NNNNNNNNNNN^mNNNNNNNNN^ 

NNNNNN^^^r^INNNNNlmNI^^ 

4981 NimNimNNNNimNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

5041 NGCCANGRAGGGGGCCAGGTTCCAANTTCCCAACCKTTTTWGGARGGAC^ 

NCGGTNCSrrCCCCCGGTCCT^GGTTNAAGGGTTGGMAAAAWCCTYCCTGNCGGGGGTCCC 

5101 GGGGATRAACAGANTNGGGGGKGGTWGGGTTNAKGGTGGGAACNCCTTNGCGCCTGGAG 
CCCCTAYTTGTCTNANCCCCCMCCAWCCCAANTMCCACCCTTGNGGAANCGSCGGACCTC 

5161 AACGTGCAAAGAGGAAATGAAGGGCCTGKGTCAAGGAGCCCAAGTNGGCGGGGRAGTTTG 
TTGCACGTTTCTCCTTTACTTCCCGGAGMCAGTTCCTCGGGTTCANCCGCCCCYTCAAAC 
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5221 CAGGGAGGCACTCCGGGGAGGTCCSGCGTGCCCGTCCAAGGGAGCAATGCGTCCTTCGGG 
GTCCCTCCGTGAGGCCCCTCCAGGSCGCACGGGCAGGTTCCCTCGTTACGCAGGAAGCCC 

52 81 TTCGTCCCCAWGCCGCGTCTACGCGCCTYCCGTCCTCCCCTTCACGTTCCGGCATTCGTG 
AAGCAGGGGTWCGGCGCAGATGCGCGGARGGCAGGAGGGGAAGTGCAAGGCCGTAAGCAC 

5341 GTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGATCAG 
CACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCTAGTC 

5401 GCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGCCCCT 
CGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCGGGGA 

5461 CCCTCGGGTTACCCCACAGCCTAGGCCGGATTCGACCTCTCTCCGCTGGGGCCCTCGCCT 
GGGAGCCCAATGGGGTGTCGGATCCGGCCTAAGCTGGAGAGAGGCGACCCCGGGAGCGGA 

Spl 
******** 

5521 GGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCATACCC 
CCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTATGGG 

5581 CCGGGTCCGCCCGGAAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCGC 
GGCCCAGGCGGGCCTTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGCG 

Topo_II_cl eavage_s i t e 
******************* 

5641 GGGCACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAAGGACTGGGGACCCGGGCACC 
CCCGTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTTCCTGACCCCTGGGCCCGTGG 

E2F 
******** 

5701 CGTCCTGCCCCTTCACCTTCCAGCTCCGCTTCTTCCGCGCGGACCCGGCCCCGTCCCGAA 
GCAGGACGGGGAAGTGGAAGGTCGAGGCGAAGAAGGCGCGCCTGGGCCGGGGCAGGGCTT 

E 

**** 

5761 CCCTTCCCAGGTCCCGGCCCAGCCCCTTCCGGGCCCTCCCAGCCCCTCCCCTTCCTTTTC 
GGGAAGGGTCCAGGGCCGGGTCGGGGAAGGCCCGGGAGGGTCGGGGAGGGGAAGGAAAAG 
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Spl 



NFkB h 
******* *.* •it****************it***** 

5821 CGCGGCCCCGCCCTCTCCTTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGCGCA 
GCGCCGGGGCGGGAGAGGAAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACGCGT 

5860 5875 
EC04 7III FSPl 

TRT5 ' 

**************************^ 

5881 CGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAGCCG 
GCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTCGGC 

5941 TGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGCGGC 
ACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACGCCG 

5953 
FSPl 



2F 



6001 GCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGC 
CGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGCGCG 

6061 TGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCCCCT 
ACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGGGGA 

NFkB 



********************** *****************^f^r ******** 

6121 CCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGG 
GGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCT^CTCCCGCCGGCCCC 

Topo_II_cleavage_s 



NFkB 

Intronl 

******************************************'irifit*****it***-^ 

6181 GGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTGTC 
CCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCACAG 



ite 
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6241 CTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAA 
GACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTTCTT 

6301 CGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTT 
GCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAA 

6361 CACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGG 
GTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCC 



6421 GGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACG 
CCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCGTGC 

64 81 CTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCT 
GACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGGCGA 

6541 GTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAG 
CATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGCTTC 

6601 GCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGG 
CGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCC 

6661 CCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCC 
GGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGG 

6721 CAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTC 
GTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAG 

6781 CTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACC 
GACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAGTGG 

6841 TGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTC 
ACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAG 

6901 CCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCA 

GGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGC 

6961 TCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTC 
AGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAG 

7021 AGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGAC 
TCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGACTG 

7081 TGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGAC 
ACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCCCTG 



6372 
FSPl 
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7141 TCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGA 
AGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGACCT 

7167 

EC047III 

7201 GCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCT 
CGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGGCGA 

7261 GCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGT 
CGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAGACA 

7321 GGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCA 
CCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGTCGT 

7381 CAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCC 
GTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGGGGG 

7441 AGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCAT 
TCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTA 

7501 CTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCG 
GAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCACGC 

7561 GGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCCAGG 
CCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGGTCC 

-A, 

7575 
FSPl 

Intron2 

7621 CCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCTCCT 
GGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGAGGA 



7681 GTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACGGTG 
CAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGCCAC 



**> 

7741 ATCGAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCGTAATCATGGTCATA 
TAGCTCCAGCTGAGATCTCCTAGGGGCCCATGGCTCGAGCTTAAGCATTAGTACCAGTAT 

A. 

7747 
SALl 
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FIGURE 23 

EST AA281296 
(Seq. ID. No. 8) 

gc 

caagttcctg cactggctga tgagtgtgta cgtcgtcgag ctgctcaggt ctttctttta 
tgtcacggag accacgtttc aaaagaacag gctctttttc taccggaaga gtgtctggag 
caagttgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggacgtgtc 
ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ctgctgacgt ccagactccg 
cttcatcccc aagcctgacg ggctgcggcc gattgtgaac atggactacg tcgtgggagc 
cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggtga aggcactgtt 
cagcgtgctc aactacgagc gggcgcg 



FIGURE 24 
(Seq. ID. No. 9) 

TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGC 
CAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCAt 
CAGGGGCAAGTC 



